Developing a Technology Allowing (Semi-) automatic Interpretative Transcription

نویسندگان

  • Daniela Gîfu
  • Mihaela Onofrei
چکیده

This paper responds to the great interest to humanities researchers who are concerned with the study of the Romanian language in its diachronic evolution: developing a set of tools allowing (semi-)automatic interpretative transcription of scanned Romanian documents written in Cyrillic, in print as well as manuscript forms. The corpus contains old data, belonging to the 19th20th centuries, in order to develop an automatic recognition and interpretative transcription of Romanian historical newspapers from Cyrillic (Cy) into Latin (La), in both manuscript and printed forms. We think that the present study will have an important impact the humanities research, including that of paleography, history, archaeology and that field of linguistics interested in the study of the language in diachrony, but it will also help the researchers in the field of computational linguistics that develops models for old language, in order to elaborate a diachronic POS tagger so necessary to recover old lemmata.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Challenges of the Digital Home in a Developing Economy

The Digital Home technology and standards are evolving towards an integrated electronic home system. This evolution will facilitate accessibility of data and information from various sources around the world to our home. The technology will also enable the automatic or semi-automatic control of lighting, climate doors and windows, and security, surveillance systems and control of other sundry d...

متن کامل

Bilevel Sparse Models for Polyphonic Music Transcription

In this work, we propose a trainable sparse model for automatic polyphonic music transcription, which incorporates several successful approaches into a unified optimization framework. Our model combines unsupervised synthesis models similar to latent component analysis and nonnegative factorization with metric learning techniques that allow supervised discriminative learning. We develop efficie...

متن کامل

'Feel the Feeling': Psychological practitioners' experience of acceptance and commitment therapy well-being training in the workplace.

This empirical study investigates psychological practitioners' experience of worksite training in acceptance and commitment therapy using an interpretative phenomenological analysis methodology. Semi-structured interviews were conducted with eight participants, and three themes emerged from the interpretative phenomenological analysis data analysis: influence of previous experiences, self and o...

متن کامل

A computer system for processing data from routine pulmonary function tests.

In larger pulmonary function laboratories there is a need for computerised techniques of data processing. A flexible computer system, which is used routinely, is described. The system processes data from a relatively large range of tests. Two types of output are produced--one for laboratory purposes, and one for return to the referring physician. The system adds an automatic interpretative repo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017